Automatic media monitoring using stochastic pattern recognition techniques

نویسنده

  • Uri Iurgel
چکیده

Information is of strategic importance for business and governmental agencies, but also for individual citizens. The use of automatic methods for selection and dissemination of information would enable media monitoring companies to cover a much larger variety of media sources by working more cost efficiently and providing 24 hours coverage and availability. This thesis investigates how professional media monitoring, which is currently a largely manual process, can be automatically supported. Three main modules are necessary for automatic media monitoring: speech recognition, topic segmentation, and topic classification. The research that was conducted on these three topics, and the resulting innovations are presented. The performance of the individual modules, as well as the complete system, is thoroughly investigated. The focus of this thesis are German news. Topic boundaries are determined using a novel approach to visual indexing. A speech recogniser transforms the audio signals into texts, which are afterwards classified for the presence of pre-defined topics. For topic classification, approaches with Hidden Markov Models, Neural Networks, and Support Vector Machines (SVMs) are investigated. One contribution of this thesis is the introduction of novel couplers for SVMs with advantages over known couplers. An additional topic covered in this thesis is Unsupervised Topic Discovery, a field nearly neglected in the literature. It makes it possible to find key-words in texts without a pre-defined topic list or training samples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

Automatic Face Recognition via Local Directional Patterns

Automatic facial recognition has many potential applications in different areas of humancomputer interaction. However, they are not yet fully realized due to the lack of an effectivefacial feature descriptor. In this paper, we present a new appearance based feature descriptor,the local directional pattern (LDP), to represent facial geometry and analyze its performance inrecognition. An LDP feat...

متن کامل

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

Stochastic Change Detection in Uncertain Nonlinear Systems Using Data-drivenSystem Identification Methods

A stochastic change detection methodology for reliable monitoring complex nonlinear dynamic systems is proposed. For a semi-active magneto-rheological (MR) damper, the non-parametric, data-driven restoring force method was used to identify the nonlinear dynamic damping device. Both supervised and unsupervised statistical pattern recognition techniques were used to detect the changes in the phys...

متن کامل

Combining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)

Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006